Goto

Collaborating Authors

 robotic and automation


DynaNav: Dynamic Feature and Layer Selection for Efficient Visual Navigation

Neural Information Processing Systems

Visual navigation is essential for robotics and embodied AI. However, existing foundation models, particularly those with transformer decoders, suffer from high computational overhead and lack interpretability, limiting their deployment in resource-tight scenarios. To address this, we propose DynaNav, a Dynamic Visual Navigation framework that adapts feature and layer selection based on scene complexity. It employs a trainable hard feature selector for sparse operations, enhancing efficiency and interpretability. Additionally, we integrate feature selection into an early-exit mechanism, with Bayesian Optimization determining optimal exit thresholds to reduce computational cost. Extensive experiments in real-world-based datasets and simulated environments demonstrate the effectiveness of DynaNav. Compared to ViNT, DynaNav achieves a 2.26 reduction in FLOPs, 42.3% lower inference time, and 32.8% lower memory usage, while improving navigation performance across four public datasets.


Grasp2Grasp: Vision-Based Dexterous Grasp Translation via Schrรถdinger Bridges

Neural Information Processing Systems

We propose a new approach to vision-based dexterous grasp translation, which aims to transfer grasp intent across robotic hands with differing morphologies. Given a visual observation of a source hand grasping an object, our goal is to synthesize a functionally equivalent grasp for a target hand without requiring paired demonstrations or hand-specific simulations.



Habitat 2.0: Training Home Assistants to Rearrange their Habitat

Neural Information Processing Systems

We introduce Habitat 2.0 (H2.0), a simulation platform for training virtual robots in interactive 3D environments and complex physics-enabled scenarios. We make comprehensive contributions to all levels of the embodied AI stack - data, simulation, and benchmark tasks.


Robot Talk Episode 145 โ€“ Robotics and automation in manufacturing, with Agata Suwala

Robohub

Claire chatted to Agata Suwala from the Manufacturing Technology Centre about leveraging robotics to make manufacturing systems more sustainable. Agata Suwala is a Technology Manager at the Manufacturing Technology Centre, where she leads cutting-edge work in automation and robotics. With over a decade of experience in R&D, Agata specialises in developing and implementing advanced manufacturing systems--particularly for the aerospace sector--transforming complex, skill-intensive processes through automation. Her recent focus is on enabling the transition to a circular economy by leveraging automation and robotics to create sustainable, scalable technologies. Robot Talk is a weekly podcast that explores the exciting world of robotics, artificial intelligence and autonomous machines.



A High-Resolution Dataset for Instance Detection with Multi-View Instance Capture

Neural Information Processing Systems

One major reason is that current InsDet datasets are too small in scale by today's standards. For example, the popular InsDet dataset GMU (published in 2016) has only 23 instances, far less than COCO (80 classes), a well-known object detection dataset published in 2014.